This descriptive data [to complete]
# WHAT ARE THE TOP CITED JOURNALS IN B&P? ------------------------ ---------
top_journals <- data %>% count(Code_Revue, sort = T) %>%
left_join(revues)
## Joining with `by = join_by(Code_Revue)`
# WHAT ARE THE TOP CITED JOURNALS, DISCIPLINES, SPECIALITIES IN B&P? ---------------------------------
top_discipline <- data %>% count(Discipline, sort = T)
top_specialite <- data %>% count(Specialite, sort = T)
top_discipline_top_journaux <- data %>%
filter(Code_Revue %in% top_journals$Code_Revue[1:20]) %>%
count(Discipline, sort = T)
fct_DT <- function(x) {datatable(x,
extensions = c('FixedColumns',"FixedHeader"),
options = list(scrollX = TRUE,
paging=TRUE,
fixedHeader=TRUE))}
fct_percent <- function(x) {
x <- x |> mutate(percent = n/sum(x$n, na.rm = TRUE)*100)
}
list_table <- list(top_journals, top_discipline, top_specialite, top_discipline_top_journaux)
lapply(list_table, fct_percent)
## [[1]]
## # A tibble: 2,360 × 6
## Code_Revue n Abbrev_11 Code_discipline Revue percent
## <chr> <int> <chr> <dbl> <chr> <dbl>
## 1 11767 3270 NATURE 18 NATURE 7.68
## 2 14558 3152 SCIENCE 18 SCIENCE 7.40
## 3 2229 2025 BIOL PHILOS 126 BIOLOGY & PHILOSOPHY 4.75
## 4 12853 1795 PHILOS SCI 126 PHILOSOPHY OF SCIENCE 4.21
## 5 5397 1420 EVOLUTION 17 EVOLUTION 3.33
## 6 13364 1170 P NAS US 18 PROCEEDINGS OF THE NATI… 2.75
## 7 15415 966 SYNTHESE 126 SYNTHESE 2.27
## 8 919 670 AM NATURAL 4 AMERICAN NATURALIST 1.57
## 9 10285 626 J THEOR BIO 12 JOURNAL OF THEORETICAL … 1.47
## 10 2521 563 BR J PHIL S 126 BRITISH JOURNAL FOR THE… 1.32
## # ℹ 2,350 more rows
##
## [[2]]
## # A tibble: 14 × 3
## Discipline n percent
## <chr> <int> <dbl>
## 1 Recherche biomédicale 18016 42.3
## 2 Sciences sociales 7756 18.2
## 3 Biologie 6015 14.1
## 4 Psychologie 3937 9.24
## 5 Médecine clinique 2749 6.45
## 6 Humanités 2651 6.22
## 7 Sciences de la terre et de l'espace 407 0.955
## 8 Santé 252 0.591
## 9 Mathématique 233 0.547
## 10 Physique 193 0.453
## 11 Champs professionnels 157 0.369
## 12 Génie 149 0.350
## 13 Chimie 62 0.146
## 14 Arts 27 0.0634
##
## [[3]]
## # A tibble: 129 × 3
## Specialite n percent
## <chr> <int> <dbl>
## 1 Recherche biomédicale - général 10224 24.0
## 2 Étude sur la science 6083 14.3
## 3 Génétique et hérédité 3682 8.64
## 4 Biochimie et biologie moléculaire 2564 6.02
## 5 Écologie 2551 5.99
## 6 Philosophie 2328 5.46
## 7 Neurologie et neurochirurgie 1459 3.42
## 8 Psychologie expérimentale 1213 2.85
## 9 Zoologie - général 1112 2.61
## 10 Science du comportement 1003 2.35
## # ℹ 119 more rows
##
## [[4]]
## # A tibble: 5 × 3
## Discipline n percent
## <chr> <int> <dbl>
## 1 Recherche biomédicale 11766 59.0
## 2 Sciences sociales 5349 26.8
## 3 Biologie 1823 9.15
## 4 Humanités 509 2.55
## 5 Psychologie 481 2.41
lapply(list_table, fct_DT)
## [[1]]
##
## [[2]]
##
## [[3]]
##
## [[4]]
All the documents of the corpus have a Code_Dicument number. The variable EType_Document has more non-null entries than Type_Document variable.
## Warning: One or more parsing issues, call `problems()` on your data frame for details,
## e.g.:
## dat <- vroom(...)
## problems(dat)
## Rows: 49 Columns: 3
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: "\t"
## chr (3): Code_Document, EType_Document, Type_Document
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## Warning: One or more parsing issues, call `problems()` on your data frame for details,
## e.g.:
## dat <- vroom(...)
## problems(dat)
## Rows: 21508 Columns: 17
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: "\t"
## chr (6): OST_BK_Cite, UID, Titre, Page_Debut, Page_Fin, Numero
## dbl (11): OST_BK_Citant, OST_BK, Annee_Bibliographique, Code_Document, Code_...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## # A tibble: 0 × 17
## # ℹ 17 variables: OST_BK_Cite <chr>, OST_BK_Citant <dbl>, OST_BK <dbl>,
## # Annee_Bibliographique <dbl>, UID <chr>, Code_Document <chr>,
## # Code_Revue <dbl>, Code_Discipline <dbl>, Titre <chr>, Page_Debut <chr>,
## # Page_Fin <chr>, Volume <dbl>, Numero <chr>, Nb_Page <dbl>,
## # Nb_Reference <dbl>, Nb_Auteur <dbl>, Nb_Adresse <dbl>
## New names:
## • `OST_BK` -> `OST_BK...1`
## • `UID` -> `UID...3`
## • `OST_BK` -> `OST_BK...16`
## • `UID` -> `UID...19`
## Warning: One or more parsing issues, call `problems()` on your data frame for details,
## e.g.:
## dat <- vroom(...)
## problems(dat)
## Rows: 2103 Columns: 26
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: "\t"
## chr (14): OST_BK...1, UID...3, Titre, Page_Debut, Page_Fin, Numero, UID...19...
## dbl (12): Annee_Bibliographique, Code_Document, Code_Revue, Code_Discipline,...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## Warning: One or more parsing issues, call `problems()` on your data frame for details,
## e.g.:
## dat <- vroom(...)
## problems(dat)
## Rows: 21508 Columns: 17
## ── Column specification ────────────────────────────────────────────────────────
## Delimiter: "\t"
## chr (6): OST_BK_Cite, UID, Titre, Page_Debut, Page_Fin, Numero
## dbl (11): OST_BK_Citant, OST_BK, Annee_Bibliographique, Code_Document, Code_...
##
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
## Warning in left_join(match_data, data_network, by = "OST_BK"): Detected an unexpected many-to-many relationship between `x` and `y`.
## ℹ Row 38 of `x` matches multiple rows in `y`.
## ℹ Row 1157 of `y` matches multiple rows in `x`.
## ℹ If a many-to-many relationship is expected, set `relationship =
## "many-to-many"` to silence this warning.
## Warning: `bindFillRole()` only works on htmltools::tag() objects (e.g., div(),
## p(), etc.), not objects of type 'shiny.tag.list'.
Note that the echo = FALSE parameter was added to the
code chunk to prevent printing of the R code that generated the
plot.